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Abstract 

We consider a dynamic metapopulation involving one large population of size N surrounded 
by colonies of size e N N, usually called peripheral isolates in ecology, where N — > oo and 
e N — > in such a way that e N N — > oo. The main population periodically sends propagules to 
found new colonies (emigration), and each colony eventually merges with the main population 
(fusion). Our aim is to study the genealogical history of a finite number of lineages sampled at 
stationarity in such a metapopulation. 

We make assumptions on model parameters ensuring that the total outer population has 
size of the order of N and that each colony has a lifetime of the same order. We prove that 
under these assumptions, the scaling limit of the genealogical process of a finite sample is a 
censored coalescent where each lineage can be in one of two states: an inner lineage (belonging 
to the main population) or an outer lineage (belonging to some peripheral isolate). Lineages 
change state at constant rate and inner lineages (only) coalesce at constant rate per pair. 

This two-state censored coalescent is also shown to converge weakly, as the landscape dy- 
namics accelerate, to a time-changed Kingman coalescent. 

Running head. The peripatric coalescent. 

Key words and phrases. Censored coalescent; metapopulation; weak convergence; peripheral iso- 
late; population genetics; peripatric speciation; phylogeny. 
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1 Introduction 

Many plant and animal populations in nature are highly fragmented, and this fragmentation plays 
a prominent role in the context of adaptation and speciation. Indeed, the emergence of new species 
is usually thought to be driven by geographical processes [5J. First, allopatric speciation occurs 
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when various subpopulations belonging to the same initial species are separated by a geographical 
barrier that prevents hybridization between them (gene flow) and allows them to diverge (genetical 
differentiation) by local adaptation. Second, parapatric speciation is a version of allopatric speci- 
ation where local adaptation is mediated by the existence of an environmental gradient (resource 
availability, environmental conditions). Third, when a species is present in one large, panmictic 
population surrounded by small colonies, usually called peripheral isolates, it is believed that the 
combination of founder events and of local adaptation to borderline environmental conditions leads 
to the formation of new species within the isolates. This phenomenon is called peripatric speciation. 
We aim to study the genealogy of populations embedded in such a spatial context. The present 
study should then serve as a building brick for future work in the field of speciation modeling. 

Population dynamic models specifying explicitly the spatial context are called metapopulation 
models (Hanski and Gilpin [8]). Typical such models include: island model, isolation by distance, 
stepping stone models, extinction-recolonization models. From the point of view of speciation, all 
these models suffer from the same defect: they assume a given, constant number of subpopulations 
in the metapopulation, with fixed migration rates between them. As one of the authors of the 
present paper suggested (Lambert [13]), an alternative method would consist in considering species 
as "spread out on a randomly evolving number of locations, allowing for repeated fragmentations 
of colonies, colonizations of new locations, as well as secondary contacts between subpopulations". 
This author and others have designed such dynamic landscape models [TU [21 [3], but usually in 
detailed ecological contexts whose study is only possible through numerical simulations (to the 
exception of [1]). 

Here we propose a mathematical study of a dynamic landscape of the peripatric type. More 
specifically, we consider a dynamic population subdivision which involves one large main population 
surrounded by a random number of small peripheral isolates, that we will call colonies for simplicity. 
The size of the main population is constant equal to N, the size of each colony is constant equal 
to e N and the genealogy in each population is given by the Moran model. The number of colonies 
at time t is denoted by ^iv(t). The landscape dynamics is as follows (see Figure 1): 

• At constant rate 9n, each individual sends independently e N N offspring to found a new 
colony; 

• Each colony independently merges again with the main population at rate Jn^n -1 , where 
a > 1; at such so-called fusion time, £ N N individuals among the new (l+e N )N individuals of 
the main population are chosen uniformly and simultaneously killed to keep its size constant. 

Note that (£jv(i);i > 0) is a pure-death process with immigration. The parameter a is meant 
to model the competition for space, since the fusion rate per colony grows with the number of 
colonies. This density-dependence disappears if a is chosen equal to 1. 

The main purpose of this paper is to investigate the genealogy of a finite sample of lineages 
in the above peripatric metapopulation model. We will show that the history of such a sample, 
viewed backward in time, can be approximated, as N — > oo under certain assumptions, by a two- 
state censored coalescent, where the state of a lineage can be inner (lying in the main population) 
or outer (lying in a colony). Lineages change state at a constant rate per lineage, but only inner 
lineages can coalesce, at a constant rate per pair of lineages, as in Kingman coalescent [12] . 

A two-state censored coalescent can be viewed as a new type of structured coalescent. The 
structured coalescent (see Takahata [IB] . Notohara [15] and Herbots [9]) describes the ancestral 
genealogical process of a sample of lineages in a subdivided population connected by migration. 
The coalescent on two subpopulations was considered by [16] ; for a finite number of subpopulations 
by [IS], and placed in a rigorous framework by [9]. To date, there have been a number of works 
dealing with the structured coalescent arising in various special types of metapopulations; see 
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Nordborg and Krone [T3], Eldon [6] and the references therein. Our results show that new types 
of structured coalescents can arise in some specific dynamic metapopulations. 

We now give the heuristics giving rise to the result. We assume that N — > oo and e N — > in such 
a way that £ N N — > oo, so that the size of colonies is large but neglectable compared to the main 
population (Assumption A). It is known that in a Moran model, inner lineages coalesce at constant 
rate per pair when time is rescaled by N (Kingman coalescent [12.]). We make assumptions on 
the parameters ensuring that all events changing the configuration of ancestral lineages occur on 
this time scale. This can only be done to the exception of coalescences in colonies, which happen 
instantaneously in the new time scale, leading to outer lineages which always all lie in different 
colonies. Also, in order to have a total outer population size of the order of JV, we need to have a 
number of colonies of the order of e~ . This can be achieved by the following choice of parameters 
(Assumption B). The per capita emigration rate On is taken equal to 



On 

and the fusion rate 7 at is taken equal to 



e JV 2 ' 



£ a-l 

7iV = 7" 



JV 

Under theses assumptions, the number of colonies is asymptotically deterministic, equal to e" 1 (0/7) 1/0 • 

Now the rate at which a single inner lineage changes state is the rate at which a single lineage 
is taken in a fusion event (backward in time), which happens at rate 

which is equivalent to 0/N as N — > 00. As a consequence, in the new time scale, inner lineages 
become outer lineages at contant rate 0. 

Also note that the probability that two lineages are taken in the same fusion vanishes, so that 
no two lineages can lie within the same colony. As a consequence, outer lineages are not allowed 
to coalesce. 

Now the lifetime of a colony is approximately exponential with parameter 

/ 0\ 1-1/0 

7ivCjv ~in£ n y-j 

which is equivalent to (0/N) (j/O) 1 ^. As a consequence, in the new time scale, outer lineages 
become inner lineages at contant rate (7/$) 1/a - By making these heuristics rigorous we get the 
results stated in Theorem 13.11 Namely, the genealogical history of a finite sample of lineages, 
seen as a process backward in time, converges weakly (except at time 0, where instantaneous 
coalescences within colonies makes the limiting process not right-continuous) to the following two- 
state censored coalescent. Inner lineages coalesce at constant rate 1 per pair, and lineages change 
type at constant rate per lineage: inner lineages become outer lineages at rate and outer lineages 
become inner lineages at rate (^ /0) l / a . 

The paper is organized as follows. In Section 2, we give a detailed description of our dynamic 
metapopulation model in forward and backward time. The main result, Theorem 13.11 is stated in 
Section 3. In addition, we also prove that under fast landscape dynamics, the censored coalescent 
converges weakly to a time-changed version of the Kingman coalescent [T2]. Finally, a section is 
dedicated to the formal proofs of the above results. 
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2 Metapopulation model 



2.1 Forward dynamics 

Let N G N with N := {0, 1, 2, • • • } and let e N be any positive number such that £ N N G N. Let 
6n, In an d a be positive constants. Consider a dynamic metapopulation model involving one 
large population of size N, called main population, and a random number of small populations, 
called colonies, of size £ N N. The main population periodically sends propagules (or emigrants) 
that found new colonies and ultimately each colony merges again with the main population. A 
further assumption is as follows. See Figure 1 for an illustration. 

(1) The number of colonies, denoted by {£/v(i) : t > 0}, evolves as a pure death density-dependent 
process with immigration and the transition rates are given by 

+ 1 at rate N6 N , . _ 

j — > j — 1 at rate 77vj°. 

When a = 1, the process {6v(£)} is reduced to a pure death branching process with immigration. 
It follows from Kelly [10] that {£jv(i)} with any initial value has the stationary distribution ir^ 
given by 

w (m (, | f (^W r 1 , T m {NQ N / lN ) k ( ^ (N8 N / lN y yi 

W>) = (l + 2^ (Jfp J andvr^(A ; ) = — (1 + ^ — j (2.2) 

for > 1. We assume that £jv(0) is distributed as tt^. Then {£jv(i)} is a stationary Markov chain. 
Let (P/ v )f>o be its semigroup. For any finite set {£i < <2 < • • • < t n } C R define the probability 
measure on N by 

^,t 2 ,-,t„(il>i2, • • • , jn) = ^(jliPh-hUuh) ■ ■ ■ P tl-t n - X {jn-l,3n)- (2-3) 

Then {Vti t2 — t n : ^i < ^2 <•••<£■« £ R} is a consistent family. By Kolmogorov's theorem, there 
is a stochastic process {£./v(t) : i G R} with finite-dimensional distributions given by (j2.3|) . Clearly, 
{&v(i) : i G R} is a stationary Markov chain with one-dimensional marginal distribution ttn and 
transition semigroup (P t N ) t >o- 

(2) At the jump times of £jv(i) from j to j + 1, one individual, chosen uniformly at random from 
the large population, gives birth to e^^V emigrant offspring individuals which found a new colony. 
We refer to such an event as "emigration" (of new colonies) or "fission" . 

(3) At the jump times of £jv(£) from j to j — 1, one colony is chosen at random from the j current 
colonies and all the £ N N individuals within this colony immediately migrate back into the main 
population. We refer to such an event as a "fusion" (of colonies with the main population). Instead 
of keeping all those (1 + £ N )N individuals in the main population alive, only N of them survive 
this fusion event, which are chosen uniformly at random among the (l + e N )N previously existing 
individuals. 

(4) Between the jump times of £jv(£)i the large population and the colonies independently evolve 
as Moran models, that is, at rate 1 each individual independently gives birth to a single offspring, 
and simultaneously a uniformly chosen individual is killed. 
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fission (fusion seen backward) 




=3^ 



Figure 1 : Dynamic metapopulation 



2.2 Backward dynamics 

Now we start with a sample of n lineages at time and proceed backward in time. Let Xjv(t) = 
(.Xjy(i), Xlf(t), ■ ■ ■ , Xpj(t)) be the ancestral process of this sample defined for t > by 

Xff(t) = the number of lineages in the main population at time — t, 

X l N {t) = the number of colonies containing i lineages at time —t (1 < i < n). 

We set Xat(O) = x, where x = (xq,xi,--- ,x n ) G N ra+1 with xq + YTi=\3 x j = n - ^he process 
{Xjv(i) : t > 0} has state-space 

n 

E := {(x ,X!,--- ,x n ) G N n+1 : 1 < xo + ^jxj < n}. 

i=i 

Define the subspace II of E 1 by 

n := {(x ,xi,0,--- ,0) G N n+1 : 1 < x + x 1 < n). 

Consider the projection V : (xq, x\, 0, • • • , 0) >->■ (xq, x\) from II to N 2 . 

r(n) = {(x ,xi) G N 2 : 1 < x +xi < n). 

By the action of the homeomorphism T, T(Il) cap be regarded as a subspace of E, and we thus 
still denote it by IT for simplicity. For x G E, let 

n 

x := (x ,J2 x j)- 

3=1 

We will use this notation for the following reason. Because in the new time scale lineages lying in the 
same colony immediately coalesce, the configuration x immediately turns into (xq, Y0j=i Xj,0, . . . ,0) 
where all outer lineages are now alone in their respective colonies. Note that x >->■ x is an injection 



5 



from E to II. We also write ej = (0, . . . , 0, 1, 0, . . . , 0) G N™ +1 whose (j + l)-th component is 1 for 
j = 0, . . . , n. 

Let 7] N (t) = £jv(-i) for t > 0. It follows from [10. Lemma 1.5, P.9] that {r] N (t) : t > 0} is still a 
stationary Markov process with the same transition rates as f)2. 1[) . Thus, the fission events (fusions 
seen backward) happen at rate 6nN and, conditioned on 7?jv(i), the fusion events (fissions seen 
backward) happen at rate JnVn^)- At any fission time, every lineage independently exits from the 
main population with probability e N /(l + e N ). At any fusion time, one colony is chosen at random 
from the existing colonies and the (say) i lineages in this colony enter the main population, and 
simultaneously coalesce together (if i > 2), and coalesce with their ancestor in the main population 
(if it is also in the sample; but asymptotically, with high probability i = 1 and the ancestor is 
not in the sample). Between fission and fusion times, coalescences within the main population or 
within colonies may happen. We again refer to Figure 1 for an illustration. 

Based on the above description, it is not hard to see that {(Xjv(i), J?jv(*)) : t > 0} is a time- 
homogeneous Markov chain taking values in E x R + . The corresponding generator is given by 

A N g(x, k) = tp N g{^, k) + 4>n9(x, k) + f N g(x, k) (2.4) 

for any bounded function g on E x N. Here 

n ( \ 2 

ip N g(x,k) = X^jU L jv_i ( g ( x ~ e i + e i- 1 ' /c ) ~ f( x > fe ) 

j=2 

which corresponds to coalescences within colonies. Note that ipNg(x,u) = if x E II. Then 

j) N g{x,k) = ( X 2 °) at - 1 ( g< - X ~ 61 ' ^ ~ 9<kX ' ^) 

x ° / \ i — 

+ OT *E (?) (ttt;) ( 9(x " re » + e ^ + " - **■*>) 

+ 7at/c q (xi/A;)(1 - (x /N))(g(x -e 1 + e ,k-l)- g(x,k)^jl {k>0} 

N 

+ 7iV& Q ^2(xj/k)(l - (xq/N)) (g(x - e,- + e 0) fc - 1) - p(x, fc)) l {fc>0 } 
i=2 

+ 77vA: Q ^(x,/A ; )(xo/iV)( p(x - ej,k — 1) — #(x, fc)) l{ fc>0 }. 

In 0jv 5 the first term corresponds to coalescences within the main population, the second term 
corresponds to exit from the main population, the third term corresponds to entrance into the 
main population, the last two terms correspond to simultaneous entrance into the main population 
and coalescence. The fourth term is identically equal to if x G II. Last, 

f N g(x,k) = Ar^(-J_)" ^(x,fc + l)-r 7 (x,A;)) 

N 

+'j N k a (l -^Xj/k^j (g{x,k- 1) - g(x,k)\l {k>0 y, 
3=1 

which corresponds to the event that the number of colonies increases or decreases but the ancestral 
process does not change. 
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3 Convergence to the two-state censored coalescent 



3.1 Main results 

Let D([0, oo), S) be the space of all cadlag functions x : [0,oo) — > S endowed with the Skorokhod 
topology for any separable and complete metric space S; see Ethier and Kurtz [TJ p. 116] for details. 
For N £ N, we consider the sequence of processes {(Xjv(-)> f]N (•))}• Define 

Y N (t) = Xjv(iVi) and fj N (t) = £ N w{Nt). 

Let 9 > and 7 > be constants. We further assume the following conditions: 



(A) e = e N satisfying e N — > and £ N N — > 00 as N — > 00; 

(B) 9 N = 9/(e N N 2 ) and lN = ^e a ' l /N. 

Recall that y E E and the corresponding y £ LT. The main result of the paper follows. 

Theorem 3.1 Let x € E. Under conditions (A) and (B), the finite- dimensional distributions of 
the ancestral process {Ytv(£), t > 0} starting at y converges to those of a H-valued continuous 
time Markov chain {Y(t), t > 0} starting at y, except at time 0. The corresponding infinitesimal 
generator Q = (<?r,r')r,r'en is given by 



Qr,r' 



( -(ero + e^/ey^n + roiro 

0r , 

r (r - 1), 
0, 



1)), */V = r, 

if ro 7^ and r' = r + (—1, 1), 
if r i 7^ an d r' = r + (1, —1), 
z/r' = r+(-l,0), 
otherwise. 



(3.1) 



where r = (ro,ri) G II. Furthermore, if the initial value y G LT, weak convergence on D([0,oo),IT) 
to {Y(t)} holds. 



The previous statement describes the asymptotic genealogical history of a finite sample of lin- 
eages, seen as a process backward in time. Except at time 0, where instantaneous coalescences 
within colonies makes the limiting process not right-continuous, this process converges weakly to a 
two-state censored coalescent, where type corresponds to inner lineages (lying in the main pop- 
ulation) and type 1 to outer lineages (lying in pairwise distinct colonies). Inner lineages coalesce 
at constant rate 1 per (ordered) pair, and lineages change type at constant rate per lineage: inner 
lineages become outer lineages at rate 6 and outer lineages become inner lineages at rate 6 (j/O) 1 ^. 

Now consider a sequence of censored coalescent processes {Yk(t)} defined by (|3.ip with parame- 
ters 9 and 7 replaced by 9k and 7^, and the initial value Yfc(0) = y E LT with yo + 2/1 = n. Let 
Y k (t) = Y k °(t) + Y^t) and let I n = {0, 1, 2, • • • , n}. We assume that 



(C) As k — > 00, 9 k — > 00, 7^ — > 00 and 9k/~/k — > P for some constant p > 0. 

The above condition corresponds to the acceleration of the landscape dynamics (emigration and 
fusion) . The following theorem states that such an acceleration gives rise to a single state coalescent 
process, where coalescence rates are obtained by averaging over the probability of presence in the 
main population. 
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Theorem 3.2 Under condition (C), the process {Y&(i), t > 0} starting at n converges weakly to 
the time-changed n-Kingman coalescent process {K(t), t > 0} on D([0, oo), I n ). When K = I, the 
coalescence rate is given by 



V- •/■ ^fl\{ p 1/a W 1 



Remark 3.3 It is easy to see that if p = which corresponds to predominant emigrations, 
{Yfc(t), t > 0} converges weakly to the constant process {K(t) = n,t > 0}; if p = oo which 
corresponds to predominant fusions, t > 0} converges weakly to the standard Kingman 

coalescent {K(t),t > 0} (i.e., c\ = 1(1 — 1)). 

3.2 Proofs 

To prove Theorem 13 .1\ we start by proving the following lemmas. 
Lemma 3.4 Under conditions (A) and (B), as N — > oo, 

wKO (^/7) 1/Q 

in D([0,oo),R+). 

Proof. Recall that the number ^(Nt) of colonies of size £ N N is a pure death density-dependent 
process with immigration with transition rates given by 

j — > j + 1 at rate 6/s N , 
j — > j — 1 at rate 7£" _1 j Q - 

By (|2.2|) . it has the stationary distribution 7Tat given by 

W(7^))'\-l , ,„ («/(7C)) t / . ^.(«/(7C)F\-l 



Recall that £jv(0) is distributed as tin- Let = [(0/7) ™/e n \. For A; > Mat, 
ir N (k) 

A g/vK) / (wr ffi g A t4 g/(7^) V 

j q v(0/(7^))^ + ft i-ii^ e ") < a / 



j=Mjv+l '"• v ' j=l i=i+l ' w w ' j=Mjv+1 j=A/jv+1 

(3M N ) 



9/(-ye a ) 9 /(-ye 01 1 i 

Note that if j > M N , j a > 6/(76%) and if j > 3Mat, /v . a N ' < '\l N ' < 4 for sufficiently large 



iV. Then IlJSk+i < 2" qM " and 



^ it < yi /V ;" ; ^2-" fc <0(2- 

fc=4Mjv+l j=Afjv+l ^ i=M]v+l ^ fe=l 



■oMjv N 
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If follows that 7tjv([4Mjv,oo)) < 0{2~ aMN ). Thus the sequence {£^(0)} is tight. On the other 
hand, {e N %N(Nt)} takes values in {ie N : i 6 N} and its generator is given by 



L N f(z) = je a - l (z/e N ) a (f(z - e N ) - f(z)) + (9/e N )(f(z + e N )- f(z)), 

for any continuous bounded function / on M + . Let C 2 (R+) be the set of twice differentiable 
functions with compact support on M+. It is not hard to see that as N — > oo, for / G C 2 (M+), 

\\L N f - Lf\\ ->• and Lf(z) = {6- 1Z a )f'(z), (3.2) 

where ||/|| = sup I . g]K+ |/(x)|. The Markov process £ with generator L is actually deterministic and 
satisfies the ODE: 

e'w = ^-7r(t), 

which has the unique equilibrium point (#/7) 1//a - It follows from (|3.2p . Theorem 6.1, P.28] and 
Theorem 9.10, P.244] that £^6/(0) (#/7) 1/a as iV -> oo. Again by ([33]) . [7, Corollary 8.7, 
p. 231] shows that {£iv(t) : i > 0} converges weakly to the constant function {£(£) = (^/t) 1 ^", i > 
0} on L>([0, oo), R + ). Since £jv(-) is stationary and time-reversible, the same weak convergence 
holds for ?7jv(")- Then this lemma is proved. □ 

As in Section 2 it is easy to see that (Yjy-(-), ?7jv(")) * s a continuous time Markov chain taking 
values in x K + . Based on (|2.4[) and Conditions (A) and (B), a simple calculation shows that the 
corresponding generator is given by 

An9(y, u) = ^iVff(y, u) + 4>Ng(y, u) + rjv5f(y, u) (3.3) 

for any bounded function g on M + x E. Here 

n / \ 1 1 

Vw(y, u) = 2 ^ y 3 - f 3 1 — n + , ) (s(y - e i + e i-i, «) - 5 (y, «) 

Note that 1/(6^ N — 1) — >■ as N — > oo by Condition (A). We also have 
4>Ng(y, u) = 2 (^^j (g(y - e , u) - g(y, ufj 

+ dyo( K g(y -e + e 1 ,u + e N ) -g(y,u) 
+ iu a ~ 1 yi(g(y -e 1 + e ,u-e N ) - g(y,u))l {u>0} 



N 

+ lu^ 1 ^2 Vj (a(y -ej + e ,u-e N )- g(y, u)j l {u>0 } 

+ (e N R 1>N g(y,u) + —u a ' 1 l {u>0 yR 2t Ng(y,ufj. 

Here the fourth term is identically equal to if y G II. In the last term, R\ t N and R2,n ar e 
bounded linear operators satisfying ||-Ri,jv|| ^ C f° r some constant C. This last term includes the 
simultaneous entrance into the main population and coalescence and it vanishes if c\ < u < C2 for 
positive numbers c\ and cq. Last, we have 

^Ng(y,u) = Oe^il - y s N )(g(y,u + e N ) - g(y,u)j 

N 

+^ u ° !£ n 1 i 1 ~ £ n u ~ 1 y j) u ~ £ n)- g(y> u )) 1 {u>o}- 

3=1 
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Let us write c^f(y) (resp. (y,u),c^ (y,u)) the total rate of the events generated by ipN (resp. 
4>N, Tjy) when Aj^g is applied to (y,u). Then 



^ e N N-U' 



( \ N 
4 (y, «) = 2 ( ^ ) + 9y + ju a -H {u>0} £ Vj + e N (l + u a ~ l l {u>0} ) 



and 



AT 

Let us introduce the following notation, 

= inf{t > : Yjv(t) € n} 

and 

cr^ = inf{t > : a 07v-event occurs at t }. 
Lemma 3.5 c,^ — > as JV-> oo. 

Proof. By Lemma 1.3, we have for any T and < 5 < (#/7) 1//o , as iV — >■ oo, 

P( sup \fj N (t) - {9hf/ a \ >5)^0. (3.4) 

v 0<t<T ' 

Fix above 5. Let d = {6/j) 1/a -5 and c 2 = (9/j) 1/a + 5. Conditioned on (Y N (t),rj N (t)) = (y,u) 
with (y, u) £ (E\H) x [ci,C2] at the current time t, 

c t y u ) 

P(the next event is a 6 N -event) = , r ; ; ; in- < Ce„, 

pf(y) + <(y,«) + c^(y,«) ~ 

for some positive constant C; 

P(the next event is a tp N -event) = N N ^ ^— jt- < 



< (y) + (y, «) + c^(y, u)-2n* + e + 7 c? ' 
for sufficiently large iV; 

• yu CrY(y, n) 6* + 7C9 

P the next event is a T -event = , , : tt- < -, 

<(y) + cj r (y,n) + cf(y,y) - 2 + + 7^' 

for sufficiently large N. Inspired by Taylor and Veber |17l Lemma 3.1], we fix some s > and 
consider 

P(<r$>s)=P(D) + o(l), 

where 

D = {a$ > s, sup \fj N (t) - (0/ 7 ) 1/Q | < S}. 

0<t<s 
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Then 

P(-D) = P({at most n ^ N -events occur in[0, s]} n D) 

= P({at most n ip - and at least a ^-events occur in[0, s]} n D) 

+ P({at most n if) N - and no (^-events occur in[0, s]} n D) 
= : /1+/2. 

Note that we have Y N (t) G E 1 \ IT for t G [0, s] if cr^ > s. Let p = 2w3 +g+ 7C a V gjjp^. Then 

n 

Ii < P ({exactly k tp N -events before a 4> N -event occur in [0, s}} n D) 

k=0 

n 00 

= P({exactly k ip N - and Z T^-events before a ^-event in [0, s]} n I?) 



fc=o z=o 

n 00 



<EE : r w (3 - 5) 

fc=0 1=0 v 7 

Since < p < 1, X^fc=o Yli=o ^~k l )p k+l < 00 • Then l\ — > as N — > 00. Let Uj 1 be the arrival time 
of the j'th event occurring to (Yjv^iv)- For I2, 



I2 = P({ exac tly k ip N -events, I T^-events and no ^-events occur in [0, s]} n D) 

k=0 1=0 

1.— n ;— n \ / 



Conditioned on (Yjv(i), fjN(t)) = (y,u) with y 6 (E\ II), the rate for the event occurring to 
(Yjv, jjjjv) at time t is c^f(y) + c^ (y,u) + Cp (y,it) and c^f (y) > 2/e N . Then Uj? +l+1 is stochastically 
bounded by the sum of i.i.d. exponential variables with parameter 2/e N whose distribution 

becomes concentrated close to as N — > 00. Thus as N — > 00, P({f7^ < s, U^ +l+1 > s}C\D) — > 
and by the dominated convergence theorem, I2 — > 0. □ 

Lemma 3.6 There exist positive constants M and K\ such that for any s > 0, 

limsupP(af < s) < M(l - e~ KlS ). 

iV->00 

Proof. By the proof of (|3.5p . P(at least one (^Ar-event occurs before cr^) — )• as N — > 00. Then 
by (|3.4p . we have 

P(of <fl) = P(G) + o(l), 

where 

G = {only t/>tv- or r^r- events before crff , sup |7?jv(£) — {& ] H) l ^ a \ < ^ an d o"^ < s}. 

0<t<s 

Recall that Yat(0) = y. If only i/j^- or T^- events occur before <Jq , (Tq < and Yjv(i) = y for 
t £ [cr^erf]. Furthermore Y/v(i) E II and c^f = for i > cr^. Conditioned on (Y N \t),fj N (t)) = 
(y,u) with (y,u) G LT x [ci,c 2 ], 

#1 < ^ (y, «) < K 2 , e-\e + 7 cf )/2 < c? (y, u) < ^(fl + 7c?), 
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for sufficiently large N, where K x = ^(c^ 1 A c^ -1 )] A 9 and K 2 = n 2 + n9 + 7n(<£ -1 V cJT 1 ). 
Then 

c ?(y>«) 2ET 2£jV c^(y,n) < + 7 c« 



c"(y, u) + c*(y, u) " 2^ 2£jv + 9 + 7 c? ' c ^ (y, «) + c f (y, u) ~ K i£n + + 7 cf " 



For (Yjv(-)) Wn(')) with initial value (y, u) G II X [ci, c 2 ], recall that £7^ denotes the arrival time of 
the j'th event occurring to (Ytv,??7v) and Uq = 0. It is not hard to see that is stochastically 
larger than the sum of j i.i.d. exponential variables with parameter £^{9 + 7 c 2 ) + K 2 . We have 

oo 

P(G) = ^P({exactly A; r^-events occur in [<rtf,<T?]} D G) 

fc=0 

A / g + 7£| \ * 2K 2 ^ / N N x 



<MP(^ + ^f < s ) 



for some positive constant M and sufficiently large N, where {Vf^} are i.i.d. exponential variables 



with parameter £ N l {9 + 7 c 2 ) + -K" 2 , and T\r is a geometric variable with parameter K ^ g x + g + ^ ca 

independent of {Vj N }- Since (Jq A 0, a simple calculation shows that + Ylj^o converges 
weakly to an exponential variable with parameter K\. The lemma is proved. □ 



Lemma 3.7 Under conditions (A) and (B), the ancestral process {Yjy(t), t > 0} starting at y 
with y E II converges weakly on D([0, oo),II) to {Y(i), t > 0} given by \3.1\) starting at y. 

Proof. If the process 3^v(t) stays in the space of II, ipN and the fourth term in <f>^ vanishes. Then 
for any bounded function g on E x M + define -Bat<? = 4>n9 + r^g, where Tj^ is given in (j3.3|) and 

4>N9(y, u) = 2 (^^j (g(y - e , u) - g(y, u) ^ 

+ 8yo( K g(y- eo + e 1 ,u + £ N ) -g(y,u) 

+ iu a ~ 1 yi(g{y - ei + e , u - e N ) - g(y, u) J 1 {m>0 } 
1 



£ N Ri,Ng{y, u) + —u a i{ u>0 }R2,Ng(y, 



u 



Let Ft = {(Yat(s), fjisr(s)) : < s < t}. Because of the Markov property of (Yjv(-)> Vn(-)), 



g(Y N (t),fj N (t))-g(y,fj N (0))- / (B N g)(Y N (s),fj N (s))ds 



o 



is a local (J r / v )-martingale. Let 

t n = inf{t > 0, |^(t-) - (0/ 7 ) 1/a | > <J or |^(t) - (#/ 7 ) 1/q | > 5}, 
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where 5 is a positive constant satisfying < 5 < {0/^) l l a . For sufficiently large N, there exist 
cy > and C2 > such that C\ < fjj^(t A tat) < C2 for any t > 0. Then for any function / on E, 

HAt n 

( N (t) = f(Y N (tAT N ))-f(y)- / (B N f){Y N (s AT N ),fj N (s Ar N ))ds 

Jo 

is a bounded martingale. Indeed, for sufficiently large N, \(Bj^f)(Yj^(s A rjv), ??Af(s A rjv))| < 
2(n 2 + n6> + ^(c" -1 V c^ -1 )) ||/|| for any s > 0. Note that E 1 is a finite set, so the discrete 
topology on E makes it a complete and compact metric space and any real valued function / on 
E is bounded and continuous. By Ethier and Kurtz jTJ Theorem 9.1 and 9.4, p. 142], the process 
{Y/v(i A tjv)} is relatively compact. On the other hand, for any t > 0, 

P(ttv < t) < P( sup \fj N (s) - (9h) l ' a \ > 5) 0, 

v 0<s<t 7 

which shows that tat — )■ oo. Let {Y(i)} be any limit point of {YAr(i A tat)}. By Skorokhod's 
representation theorem we may assume that on some Skorokhod space (tat, Yj^(t A TN),fjj^(t A 
tat),) a 4' (oo,Y(i),(0/ 7 ) 1/a ) in the topology of M+ x D([0,oo),E x M + ). Thus Civ(i) ^4" C(*) and 
is given by 

C(t) = /(Y(i)) - /(x) - [\Bf)(Y(s))ds, (3.6) 

•/ o 

where 

y) = 2 M (/(y - e ) - /(y)) + 0y o (/(y - e + ex) - /(y)) 
+ #( 7 /0) 1/a yi (/(y - ex + e ) - /(y)) . 

Since sup^r |Cjv(*)I < Gv(*) -4 C(*) fo r an y t > 0. Thus is a martingale. Since {Y(t)} 
given by f)3. 1 1) is the unique solution to the martingale problem (|3.6p . We have that {Yat(£ A tn)} 
converges weakly to {Y(i)} given by (|3.1j) in D([0, oo), E). Furthermore, for any e > and any 
t > 0, 

P( sup \Y n (sAt n )-Y n (s)\ > e) < P(t n < t) -+ 0, 

v 0<s<t 7 

as N — > oo. The lemma follows from the above limit. □ 



Proof of Theorem \3.1\ Let Py (•) be the distribution of (Yat(-)j Vn{')) with initial value (y, 7^(0)), 
where r/7v( - ) is distributed as ttn given in Section 2. Let /i, • • ■ , be real-valued functions on E*. 
Choose < s < ti < ■ ■ ■ < tk- Let Qn = {cr^ < s < a^}. Then 



E, 



E, 



E, 



E, 



Ut 1 fi(Y N (t l ))l Q[ 



1 OAr E (y,wW) 



E 



n^/^YAr^))!^ 

nJU/iCMii - s )) 

nti/iO^fe - s)) 



(y,VN(s)) 



I$ =1 fi(Y N (ti - s)) 



E, 



1 Qiv E ((y,^( s )) 



nti/i(Y2v(ti - *)) 



■E, 



1 <5iv E (y^iv(s)) 



Dti/iCYjvCti - s)) 
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where Qm is the complement of the set Qn- The last equality follows from the fact that ?7jv(") is 
stationary. Then by Lemmas 13,51 13,61 and 13,71 



lim sup 

AT-i-oo 



n?=i/i(YAKti)) 



nUm{u-s)) 



< 2 max limsupP((5Ar) + lim sup 

* N— >oo N->oo 

< 2Mmax||f,;||(l-e- Xls ), 



E, 



nti/iCYjvCti - s)) 



n? =1 /i(Y(^-s)) 



which goes to as s — > 0. Note that Y(i) is stochastically continuous. Ey 
converges to Ey U k =1 fi(Y(ti)) 



as s — > 0. Then we have that lim jy'-foo Ey 



_n?=i/i(Y(ti - s)) 



E, 



nf = i/,(Y(iO) 



□ 



Proof of Theorem \3.2\ Step 1: recall the notation in Section 2.2. Under the homeomorphism, 
the subspace II can be regarded as r(Il) for simplicity. It follows from (|3.6p that for any function 

/on n, 



/(Y fe (t))-/(y) 



(B k f)(Y k (s))ds 



is a martingale, where 



B k f(y) = 2 1 2° ) [f(y + (-1,0))- /(y)J + 9 k y [f(y + (-1, 1)) - /(y) 
+ e k (ik/e k ) 1/a yi (7(y + (i, -i)) - /(y) 



Recall that Y fc (t) = (Y fc °(t), Y k l (t)) and F fc (i) = F fc °(t) + Y£(t). For any function g on J n , let 
/(y) = 5(yo + 2/1 ) for yGlI. Then 

g(Y k (t)) - g(n) - f (B k g)(Y k °(s),Y k 1 (s))ds 



is also a martingale, where B k g{y) = 2y*2\(g(y — 1) — g(y)). Note that I n is a finite set, so the 
discrete topology on I n makes it a complete and compact metric space. Any real valued function 
g on I n is bounded and continuous. Then Y k {-) satisfies the compact containment condition. For 
each T > 0, sup fc / \B k g(Y k °(s),Y k (s))\ds < 2n 2 T\\g\\, where ||g|| = sup J/e/n \g{y)\. By Ethier and 
Kurtz [7J Theorem 9.1 and 9.4, p. 142], Y k (-) is relatively compact in D([0,oo),I n ). 

Step 2: suppose that {^(-)}™ =1 is the sequence of i.i.d. Markov chains taking values in {0,1} 
and whose transition rate matrix is given by 



(i)- "(ft 



1 1 

i 



Let P^(i) = P(£* (t) = (0) = i). A simple calculation shows that 



P k o(t) = I - P \(t) 



(0khk) 1/c 



+ 



ifi k hk 



.I/O 



(^/7fc) 1/a 



-(l+(e fc /7fc) 1/Q )i 



-(l+(e fe /7fc) 1/Q )i 



1 + (V7fc) 1/a 1 + (^/7fc) 1/Q 
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LetCn fe (*) = £-=il 
that for any g on /„ 



«?(*)=o}' 



Since {^f(t)}™ =1 are independent of each other, it is not hard to see 



sup 



E x [g(C*(t))}-V y lg(C(t))] <2n\\g\\e 



t > 0. 



This implies CnW satisfies the ^-mixing condition (see |H P.lll]). By (1.13) of [H p. 109], 

sup |Ej<KC£(i 2 ))<?(C*(*i))] -E 2/ [ ff (^(t 2 ))]E,[ 3 (C^(ti))]| 

2/eJn 

< 2V2^|| 9 || 2 e -( 1+ ^/^) 1/a )(* 2 - 41 )/ 2 
for any i 2 > ii > 0. Then 



g(C(9 k s))-E y [g(C(9 k s))})ds 



E„ 



t i-t 




o •/ o 



J J ds x ds 2 (g& kSl )) - B y [g(C(e kSl ))}) (g(£(9 k s 2 )) - B y [g(C(e k s 2 ))f 
d Sl ds 2 (B y lg(C(e k s 2 ))g(C(e kS ^ 

< C(n)\\gf f ds 2 f e-^{i+{e k h k )^)\s,-sA/2 dsi 

Jo Jo 

< C(n)\\gft/e k , (3.7) 
where C(n) is a constant only depending n. Since Po\(6 k t) — > ^ \/ a and P^Oj-t) — > 1+ \/ a as 



k —> oo, it is easy to see for any t > 0, C^{9 k t) —> Cn as — > oo, where £ n follows the Binomial 



distribution, i.e., ( n ~ Bn(n, ). Note that J ra is finite. The dominated convergence theorem 

shows that 

sup f B y [g(C(d k s))] - E[ 5 (Cn)] ^ -»• 0, 

2/SJn JO 

as A; — >• oo. Combined with (|3.7|) . we have as k — > oo, 



SUP Fly 



g(C*(e kS ))-B[g(C n )])ds 



0. 



(3.8) 



Step 3: (Y k °(t),Y k (t)) is a Markov process as in Step 1. Y k °(0) = y £ I n and Y k (0) = n. Let 
= a{{Y k °(s),Y k {s)) : < s < t}. Define = inf{t > : Y k {t) = n - j} with T fc = and 



PW+i > «) 



E 
E 

E 



P(r| +1 >t 



(K fc °(2j') I rH)( 7 i+l > *) 



E 



V fc °(T fe )l 



(exp{- / hiCLjihs^ds}) 



-E[fc(C»-j)]* 



as A; — >• oo. Similarly 

P^^r* >s) 



E 



1 {r 1 fe >t} E y fe °(T*)( eX p{ - / MC£-l(0k*))<k}) 
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Then 



|P(7f > s,r 2 fc > t) - e -E[ft(C»)]«-E[h(Cn-i)]< 

ft 



< |P(rf > s 



e -E[h(f n )]5| + gup Ey 







MC_i(M)-E[/ i (Cn-i)] )ds 



By (|3.8p and (|3.9p we have that the second term in the right-hand side of the above inequality 

goes to as k — > oo. By induction, (rf , ■ • ■ ,r^_ 1 ) -4- (ti,--- ,r n _i), where {7^}™"* is indepen- 
dent of each other and Tj follows the exponential distribution with parameter c n _j+i. It follows 
that {Yk(t), t > 0} converges in the sense of finite-dimensional distributions to the n-Kingman 
coalescent process {K(t), t > 0}. Since {!&(£)} is relatively compact, the theorem is proved. □ 
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